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Abstract. This note describes non-asymptotic variance and tail bounds for 
order statistics of samples of independent identically distributed random vari- 
ables. Those bounds are checked to be asymptotically tight when the sampling 
distribution belongs to a maximum domain of attraction. If the sampling dis- 
tribution has non-decreasing hazard rate (this includes the Gaussian distribu- 
tion), we derive an exponential Efron-Stein inequality for order statistics: an 
inequality connecting the logarithmic moment generating function of centered 
order statistics with exponential moments of Efron-Stein (jackknife) estimates 
of variance. We use this general connection to derive variance and tail bounds 
for order statistics of Gaussian sample. Those bounds are not within the scope 
of the Tsirelson-Ibragimov-Sudakov Gaussian concentration inequality. Proofs 
are elementary and combine Renyi's representation of order statistics and the 
so-called entropy approach to concentration inequalities popularized by M. 
Ledoux. 



1. Introduction 

The purpose of this note is to develop non-asymptotic variance and tail bounds 
for order statistics. In the sequel, Xi, . . . ,X„ are independent random variables, 
distributed according to some probability distribution and ^(i) > ^(2) > ■ • ■ ^ 
X(„) denote the corresponding order statistics (the non-increasing rearrangement of 
Xi, . . . , Xn)- The cornerstone of Extreme Value Theory (EVT), the Fisher-Tippett- 
Gnedenko Theorem describes the asymptotic behavior of ^(i) after centering and 
normalization |2j- The median X(l„/2J) is a widely used location estimator. Its 
asymptotic properties are well documented (See for example jllj for a review). 
Much less seems to be available if the sample size n is fixed. The distribution 
function oi X(i) is obviously explicitly known (F" !), but simple and useful variance 
or tail bounds do not seem to be publicized. 

Our main tools will be the Efron-Stein inequalities that assert that the jackknife 
estimate(s) of the variance of functions of independent random variables are on 
average upper bounds, and extensions of those inequalities that allow to derive 
exponential tail bounds (See Theorem 12. ip . 

We refer to [51 [S] and references therein for an account of the interplay between 
jackknife estimates, order statistics, extreme value theory and statistical inference. 

The search for non-asymptotic variance and tail bounds for extreme order statis- 
tics is not only motivated by the possible applications of EVT to quantitative risk 
management, but also by our desire to understand some aspects of the concentra- 
tion of measure phenomenon [H [7] . Concentration of measure theory tells us that 
a function of many independent random variables that does not depend too much 
on any of them is almost constant. The best known results in that field are the 
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Poincare and Gross logarithmic Sobolev inequalities and the Tsirelson-Ibragimov- 
Sudakov tail bounds for functions of random Gaussian vectors. If Xi, . . . ,X„ are 
independent standard Gaussian random variables, and / : M" — )■ M is L-Lipschitz, 
then Z = f{Xi, . . . ,X„) satisfies Var(Z) < logE[exp(A(Z - EZ))] < and 

¥{Z — EZ >t}< exp{~jjj) . If we apply those bounds to (resp. to X(l„/2J)) 
that is the maximum (resp. the median) of Xi,. . . the Lipschitz constant is 
(almost surely) L = 1, so Poincare inequality allows to establish Var(X(i)) < 1 
(resp. Var(X(|^„/2j)) 5; 1 )• This easy upper bound is far from being satisfactory, 
it is well-known in EVT that Var(X(i)) = O(j^^i^) and Var(X(^„/2j)) = ^in)' 
Naive use of off-the-shelf concentration bounds does not work when handling max- 
ima or order statistics at large. This situation is not uncommon. The analysis 
of the largest eigenvalue of random matrices from the Gaussian Unitary Ensemble 
(GUE) pi provides a setting where the derivation of sharp concentration inequali- 
ties require ingenuity and combining concentration/hypercontractivity with special 
representations. 

Our purpose is to show that the tools and methods used to investigate the con- 
centration of measure phenomenon are relevant to the analysis of order statistics. 
When properly combined with Renyi's representation for order statistics (see The- 
orem [23]) the so-called entropy method developed and popularized by Ledoux [4] 
allows to recover sharp variance and tail bounds. Proofs are elementary and par- 
allel the approach followed in [5] in a much more sophisticated setting: whereas 
Ledoux builds on the determinantal structure of the joint density of the eigenvalues 
of random matrices from the GUE to upper bound tail bounds by sums of Gaussian 
integrals that can be handled by concentration/hypercontractivity arguments, in 
the sequel, we build on Renyi's representation of order statistics: , . . . , ^(n) can 
be represented as the image of the order statistics of a sample of the exponential 
distribution by a monotone function. The order statistics of an exponential sample 
turn out to be represented as partial sums of independent random variables. 

In Section [2l using Efron-Stein inequalities and modified logarithmic Sobolev 
inequalities, we derive simple relations between the variance or the entropy of order 
statistics ^(fc) and moments of spacings = -^{k) ~ -^(fe-i-i)- When the sampling 
distribution has non-decreasing hazard rate (a condition that is satisfied by Gauss- 
ian, exponential, Gumbel, logistic distributions, etc, see 12.61 for a definition), we 
are able to build on the connection between the fluctuations of order statistics X(^j.-^ 
and spacings. Combining Proposition 12.31 and Renyi's representation for order sta- 
tistics, we connect the variance and the logarithmic moment generating function of 
with moments of spacings. Theorem 12.91 may be considered as an exponential 
Efron-Stein inequality for order statistics. 

In the framework of EVT, those relations are checked to be asymptotically tight 
(see Section [3]). 

In Section [4l using explicit bounds on the Gaussian hazard rate, we derive 
Bernstein-like inequalities for the maximum and the median of a sample of in- 
dependent Gaussian random variables with a correct variance and scale factors 
(Proposition 14. 6| ). We provide non-asymptotic variance bounds for order statistics 
of Gaussian samples with the right order of magnitude in Propositions 14. 2l and l4.4l 
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2. Order statistics and spacings 

Efron-Stein inequalities ([3,) allow us to derive upper bounds on the variance of 
functions of independent random variables. 

Theorem 2.1. (Efron-Stein inequalities.) Let f : M" M. be measurable, and let 
Z = /(Xi, . . . ,X„). Let Z, = . . . . . . ,X„) where M"-i ^ M 

is an arbitrary measurable function. Suppose Z is square-integrable. Then 

n 

Var[Z] < [(Z-Z,)' . 

The quantity X]r=i(-^ ~ ^if' called a jackknife estimate of variance. 
Efron-Stein inequalities form a special case of a more general collection of in- 
equalities that encompasses the so-called modified logarithmic Sobolcv inequalities 

a- 

Henceforth, the entropy of a non- negative random variable X is defined by Ent[X] = 
E[XlogX] — EX log EX. The next inequality from [6j has been used to derive a 
variety of concentration inequalities [1]. 

Theorem 2.2. (Modified logarithmic Sobolev inequality.) Let t{x) — e^ ~ x — 1. 
Then for any A G M, 

Ent [e^^] = AE [Ze^^] - E [e^^] logE [e^^] < f^E [e^^r {-\{Z - Z{))\ . 

1=1 

Theorems l2.1l and l2.2| provide a transparent connexion between moments of order 
statistics and moments of spacings. 

Henceforth, let ?/;: M — ^ R+ be defined by ip{x) — e^T{—x) = 1 -f (x — l)e^ . 

Proposition 2.3. (Order statistics and spacings.) For all 1 < k < n/2, 
Var[X(fe)]<A:E[(X(fe)-X(,+i))2] 

and for all A G K, 

Ent [e^^C")] < m [e^^C'+i) V(A(X(fe) - X^^+i)))] ■ 
For all n/2 < k < n, 

Var[X(fc)] < in -k + 1)E [(X^k-i) - X^k)?] 

and for all A G K, 

Ent [e^^c^)] < (n - fc + 1)E [e^^(-)r(A(X(fe_i) - X^k)))] ■ 

Proof. Let Z = X(j,-) and for k < n/2 define Zi as the rank k statistic from sub- 
sample Xi, . . . , Xi-i, Xj+i, . . . , Xn, that is Zi = X^j^^i-^ if Xi > Xf^j.-^ and Zi = Z 
otherwise. Apply Theorem 12. II 

For k > n/2, define Zi as the rank /c— 1 statistic from Xi, . . . , Xj_i, . . . , X„, 

that is = X(i,_i-) if < and Zi = Z otherwise. Apply Theorem 12.11 again. 

For k < n/2, define Z and Zi as before, apply Theorem 12.21 

Ent[e^^('=)] < fcE[e^^<'=)r(-A(X(,)-X(fe+i)))] 

= fcE [e^^('»+i)e^(^('=)-^('=+i))T (-A(X(fc) - X^k+i))) 

- fcE[e^^"'+i)V'(A(X(fe) . 
The proof of the last statement proceeds by the same argument. □ 
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Proposition l2.3l can be fruitfully complemented by Renyi's representation of order 
statistics (See [2] and references therein). 

In the sequel, if / is a monotone function from (a, b) (where a and b may be 
infinite) to (c, d), its generalized inverse /■*" : {c,d) — >■ (a, &) is defined by f^{y) = 
inf{a; : a < x < b, f{x) > y} (See [2 for properties of this transformation). 

Definition 2.4. The U- transform of a distribution function F is defined as a 
function on (l,cx)) by U = (1/(1 - F))^ , U{t) = ixii{x : F{x) > I - l/t} = 
F-(l-lA) . 

Renyi's representation asserts that the order statistics of a sample of indepen- 
dent exponentially distributed random variables are distributed as partials sums of 
independent exponentially distributed random variables. 

Theorem 2.5. (Renyi's representation) Let X(i) > ■ ■ ■ > be the order statistics 
of a sample from distribution F , let U = (1/(1 — F))'^ , let Y(i) > Y(2) > • ■ • ^ ^{n) 
be the order statistics of an independent sample of the exponential distribution, then 

n n 

(r(„),...,y(,),...,r(i))^(f-,...,^t,...,^f^) 

k—i k—1 

where Ei, . . . , Ek are independent and identically distributed standard exponential 
random variables, and 

(X(„),...,X(i)) - (C/oexp(y(„) ),...,?/ oexp(F(i))) . 

We may readily test the tightness of propositions 12.31 By Theorem 12. 5i Y(^k) = 
%^ + ■ ■ ■ + ^ and Var[Y(fc)] = J2i=k F • Hence, for any sequence (fc„)„ with 
lim„ kn = oo, and limsup kn/n < 1, lim„_>.oo fc„ Var[Y(j.^)] — 1, while by Proposition 

IZl Var[r(fe)] < kE[{Ek/kf] - |. 

The next condition makes combining Propositions 12.31 and Theorem 12.51 easy. 

Definition 2.6. (Hazard rate.) The hazard rate of an absolutely continuous proba- 
bility distribution with distribution function is: h — f /F where / and F = 1 — F 
are respectively the density and the survival function associated with F. 

From elementary calculus, we get {U o exp)' — l/h{U o exp) where U — (1/(1 — 
F))^, which translates into 

Proposition 2.7. Let F be an absolutely continuous distribution function with 
hazard rate h, let U = (1/(1 — F))'^ . Then, h is non- decreasing if and only if 
U o exp is concave. 

Observe that if the hazard rate h is non-decreasing, then for alH > and x > 0, 
U (exp(t -I- x)) — U{exp{t)) < x / h{U {exp{t))) . Moreover, assuming that the hazard 
rate is non-decreasing warrants negative association between spacings and related 
order statistics. 

Proposition 2.8. If F has non- decreasing hazard rate, then the fc"* spacing Aj, = 
— X(^i^^i-f and are negatively associated: for any pair of non- decreasing 

functions g I and g2, 

E[gi(X(fe+i))52(Afe)] <E[<?i(X(fc+i))]E[g2(Afe)] • 
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Proof. Let ^(n), • ■ • , Y(^i) be the order statistics of an exponential sample. Let Ek = 
Y(^k) ^ ^(fe+1) be the k*"^ spacing of the exponential sample. By Theorem 12.51 Ek 
and y(fc+i) are independent. Let 171 and 32 be two non-decreasing functions. By 
Theorem ESI 



E[gi(X(,+i))52(Afe)] = E[gi(C/(e^<'=+i))).g2(C/(e^'=+^<'=+i') - [/(e^c^+i) ))] 



= E[E[.gi(C/(e 



= E [gi(C/(e^('=+i)))E [52(C/(e^'=+''<^+^>) ~ C/(e^(^+i))) | Y^k+i) 

The function gi o U o exp is non-decreasing. Almost surely, as the conditional 
distribution of kEk with respect to Y^^+i) is the exponential distribution, 



E[52(C/(e 



Ek+Y,, 



As F has non-decreasing hazard rate, ?7(exp(x/fc -I- y)) — ?7(exp(?;)) = Jq^''{U ° 
exp)'(2; -f z)dz is non-increasing with respect to y. 

This entails that E [g2{U{e^''+^(''+^y) - {/(e'^c^+D )) | ^(fc+i)] is a non-increasing func- 
tion of Y(^k+i)- Hence, by Chebyshev's association inequality, 

E[5i(X(fc+i))g2(Afe)] 

< E [<7i(C/(e^('=+i)))] X E [E [g2{U{e^'+''i''+^^) - U{e^(''+^^)) \ ^(fe+i)]] 

= E [5i(C/(e^<'=+i)))] X E [.g2(C/(e-^'=+^<'=+^') - C/(e^(^+i) ))] 

= E[5i(X(fc+i))] xE[g2(Afc)] • 

□ 

Negative association between order statistics and spacings allows us to establish 
our main result. 



Theorem 2.9. Let Xi, . 

X(i) > 



, Xn be independently distributed according to F , let 



> ^(n) be the order statistics and let Aj 



X, 



(k) 



X 



(fe+1) 



be the k*^ 



spacing. Let Vk — fcA^ denote the Efron-Stein estimate of the variance of X^k) (for 
k = l,...,n/2). 

If F has non- decreasing hazard rate h, then for 1 < k < n/2, 



Var < EVk < -E 



while for k > n/2, 



Var [X^k)] 



< 



2{n-k-\-l) 
{k - 1)2 



E 



For X>0,andl<k< n/2, 
(2.10) 



logEe^('^<'=) 



-EX, 



A^E [A, (, 



-1 



= A-E 
2 



Inequality (j2.10l) may be considered as an exponential Efron-Stein inequality for 
order-statistics: it connects the logarithmic moment generating function of the k^^ 
order statistic with the exponential moments of the square root of the Efron-Stein 
estimate of variance fcA^. This connection provides correct bounds for exponential 
distribution whereas the exponential Efron-Stein inequality described in [1] does 
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not. This comes from the fact that negative association between spacing and order 
statistics leads to an easy decouphng argument, there is no need to resort to the 
variational representation of entropy as in [1 . It is then possible to carry out the 
so-called Herbst's argument in an effortless way. 

Proof. Throughout the proof Y(fc) , ^(fc+i) denote the k*"^ and k + 1^^ order statistics 
of an exponential sample of size n. By Proposition [531 using Renyi's representation 
(Theorem 12. 5p . and Proposition 12. 7[ for k < n/2, 

Var[X(fc)] < /cE[(C/(e'*'('=+i)e'^('»)"'^('=+i)) -[/(e^f'+i))) 



( 1 



< kE 

< -E 
k 

as by Theorem 12.51 ^(fc) ^ ^(fe+i) is independent of ^(fe+i) and exponentially dis- 
tributed with scale parameter 1/k. 

By Proposition 12.31 and 12 . 71 as ip is non-decreasing over R+, 

Ent[e^^<^)] < fcE [e^-^C'+DVXAAfe)] 

< fcE [e^-^C'+D] X E[V'(AAfe)] 

< fcE [e^-^C")] xE[iP{XAk)] . 

Multiplying both sides by exp(— AEX^j,)), 



Ent 





< fcE 


gA(X(,)-EX(,))" 









X E mXAk)] 



Let G(A) = Ee'^^'=. Obviously, G(0) = 1, and as A^ > 0, G and its derivatives are 
increasing on [0, oo), 

/•A \2 

E [ipiXAk]] = 1 - G(A) + AG'(A) = / sG"(s)ds < G"(A)— . 

Jo 2 

Hence, for A > 0, 

Ent [e^(^('=)-'^^(''))] dilogEe^('^<'»)-^-^('=)) fc dG' 
A2E [e^(^w-E^(fc))] " dA - 2"dA"' 

Integrating both sides, using the fact that hmA^o i logEe^(^('')"'^^<'=)) = 0, 

^logEe^(^<'=)-^^(-)) < ^(G'(A) - G'(0)) = ^E [A^ (e^^'= - l)] . 
A 2 2 

□ 



3. Asymptotic assessment 

Assessing the quality of the variance bounds from Proposition 12.31 in full gen- 
erality is not easy. However, Extreme Value Theory (EVT) describes a framework 
where the Efron-Stein estimates of variance are asymptotically of the right order 
of magnitude. 
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Definition 3.1. The distribution function F belongs to a maximum domain of at- 
traction with tail index 7 G M (i^ G MDA(7)), if and only if there exists a non neg- 
ative auxiliary function a on [1, 00) such that for x G [0, 00) (if 7 > 0), G [0, — 1/7) 
(if 7 < 0), X G M (if 7 = 0) 

limP <^ ^— — <x} ^ exp(-(l + -/x)-^/^) . 

I a[n) J 

If 7 = 0, (1 + 72:)^-'^/^ should be read as exp(— x). 

If F G MDA(7) and has finite variance (7 < 1/2), the variance of (max(Xi, . . . , X„) — 
F'^(l — l/n))/a{n) converges to the variance of the limiting extreme value distri- 
bution [5]. 

Membership in a maximum domain of attraction is characterized by the extended 
regular variation property off/ = (1/(1 — F G MDA(7) with auxiliary 

function a iff for all x > 

UUx) - U{t) x'f - 1 

lim = , 

t->-oo a[t) 7 

where the right-hand-side should be read as log x when 7 = [2] . 

Using Theorem 2.1.1 and Theorem 5.3.1 from j2j, and performing simple calculus, 
we readily obtain 

Proposition 3.2. Assume X(^i^ > ■ ■ ■ > -^(n) the order statistics of an indepen- 
dent sample distributed according to F, where F G MDA{"f),^ < 1/2 with auxiliary 
function a. Then 

^.^ e[((x,,, X,,,))-] ^ 2r(2(i ,)) J_(r(i_27)-r(l-7n 

„ a(n)^ (l-7)(l-27) „ a(n)^ ^2 V ^ " ^ " ) 

For 7 = 0, the last expression should be read as 7r^/6. 

The asymptotic ratio between the Efron- Stein upper bound and the variance 
of converges toward a limit that depends only on 7 (for 7 = this limit 
is 12/7r^ « 1.21). When the tail index 7 < 0, the asymptotic ratio degrades as 
7 — >■ —00, it scales like —47. 

4. Order statistics of Gaussian samples 

We now turn to the Gaussian setting. We will establish Bernstein inequalities 
for order statistics of absolute values of independent Gaussian random variables. 

A real-valued random variable X is said to be sub-gamma on the right tail with 
variance factor v and scale parameter c if 

logEe^*-^"^-^) < — — for every A such that < A < 1/c . 

2(1 — cA) 

Such a random variable satisfies a so-called Bernstein- inequality: for t > 0, 
P{X> EX + V2vi+ct} < exp (— t) . A real-valued random variable X is said to 
be sub-gamma on the left tail with variance factor v and scale parameter c, if —X is 
sub-gamma on the right tail with variance factor v and scale parameter c. A Gamma 
random variable with shape parameter p and scale parameter c (expectation pc and 
variance pc^) is sub-gamma on the right tail with variance factor pc^ and scale factor 
c while it is sub-gamma on the left-tail with variance factor pc^ and scale factor 0. 
The Gumbel distribution (with distribution function exp(— exp(— cc)) is sub-gamma 
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on the right-tail with variance factor 7r^/6 and scale factor 1, it is sub-gamma on 
the left-tail with scale factor (note that this statement is not sharp, see Lemma 
1131 below). 

Order statistics of Gaussian samples provide an interesting playground for as- 
sessing Theorem 12.91 Let $ and denote respectively the standard Gaussian dis- 
tribution function and density. Throughout this section, let U : ]1, oo) — >■ [0, oo) be 
defined by U{t) = $^(1 - l/{2t)) , U{t) is the 1 - 1/t quantile of the distribution 
of the absolute value of a standard Gaussian random variable, or the 1 — l/(2t) 
quantile of the Gaussian distribution. 

Proposition 4.1. Absolute values of Gaussians have non- decreasing hazard rate : 
i) U o exp is concave; 

a) For y > 0, (?!)(C/(exp(j/)))/¥(C/(exp(y))) > + log 2) where ki > 1/2. 

Hi) For t > S, 



y/2\og{2t) - loglog(2i) - log(47r) < U^t) < y/2\og{2t) - loglog(2t) - logTT . 

Proof, i) As (t/oexp)'(i) = 'W{U{e^))/(t){U{e*)) it suffices to check that the standard 
Gaussian distribution has non-decreasing hazard rate on [0,oo). Let h = 0/$, by 

elementary calculus, for x > 0, h'{x) = (0(x) — x^{x)) (j){x)/^ {x) > where the 
last inequality is a well known fact. 

h) For Ki = 1/2, for p e (0, 1/2], the fact that py/ ki logl/p < 0o follows 
from (/)(a:) — x^{x) > for a; > 0. Hence, 

'^{<i>^{l-e-y/2)) _ e-y/2 ^ 1 



(ci,^(l_e-V2)) 0($^(e-V2)) " ^Ki(log2 + t/) 



iii) The first inequality can be deduced from (p o < py^logT/p, for 

p e (0, 1/2) [To], the second from p^Kilogl/p < <j)o $^(p) . □ 

The next proposition shows that when used in a proper way, Efron-Stein in- 
equalities may provide seamless bounds on extreme, intermediate and central order 
statistics of Gaussian samples. 

Proposition 4.2. Let n > 3, let > . . . > X(n) be the order statistics of 

absolute values of a standard Gaussian sample, 

1 8 

Forl<k< n 2, Var Xf^) < — , -. ^ . 

- - ^ - fclog21ogf -log(l + |loglogf ) 

By Theorem 5.3.1 from [2], lim„ 2 logn Var[X(i)] = 7r^/6, while the above de- 
scribed upper bound on Var[j'r(i)] is equivalent to (8/ log 2)/ log n. If lim„ fc„ = oo 
while lim„ fc„/n = 0, Smirnov's lemma [2] implies that lim„ k{U{n/k))'^ Var[X(j.)] = 
1. For the asymptotically normal median of absolute values, lim„(4(/)([/(2))^n) Var[X(, 
1 [11]. Again, the bound in Proposition 14.21 has the correct order of magnitude. 



Lemma 4.3. Let Y(-j.) he the /c*'' order statistics of a sample of n independent 
exponential random variables, let log 2 < z < log(n//c), then 

P {Yik+D < \ogin/k) -z}< cxp (-^^) . 
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Proof. 



< exp 



> 7 / \ n 
A:(e^-1)2 



2e^ 



since the right-hand-side of the first fine is the probabihty that a binomial random 
variable with parameters n and is less than k, which is sub-gamma on the 
left-tail with variance factor less than ke^ and scale factor 0. □ 

Proof. By Propositions [121 and |1?T1 letting ki = 1/2 

2 



Var(X(,)) < 



iog2 + y(fe+i) 
1 4„r.. . , 1 4 1 



"{Yik+i) < log(n/fc) - z} 



< 



log2fc L ^"-^^^ - o\ ' ' J ' /clog^ -z + log2 
4 14 1 



k log 2 log f fc log f - log(l + f log log f ) ' 
where we used Lemma with z = log(l + ^ log log ^). □ 

Our next goal is to establish that the order statistics of absolute values of inde- 
pendent Gaussian random variables are sub-gamma on the right-tail with variance 
factor close to the Efron-Stein estimates of variance derived in Proposition 14.11 and 
scale factor not larger than the square root of the Efron-Stein estimate of variance. 

Before describing the consequences of Theorem 12.91 it is interesting to look at 
what can be obtained from Renyi's representation and exponential inequalities for 
sums of Gamma-distributed random variables. 

Proposition 4.4. Let ^(i) be the maximum of the absolute values ofn independent 
standard Gaussian random variables, and let U{s) — $^(1 — 1/ (2s)) for s > 1. For 
t > 0, 

P - EX(i) > t/{3U{n)) + Vt/U{n) + (5„} < exp {~t) , 



where Sn > and lim„([/(n)) (5„ 



12 



This inequality looks like what we are looking for: C/(n)(X(i) — EX(i)) converges 
in distribution, but also in quadratic mean, or even according to the Orlicz norm 
defined by x i— >• expdxj) — 1, toward a centered Gumbel distribution. The centered 
Gumbcl distribution is sub-gamma on the right tail with variance factor 7r^/6 and 
scale factor 1, we expect X^i-^ to satisfy a Bernstein inequality with variance factor 
of order l/[/(n)^ and scale factor 1/U{n). Up to the shift (5„, this is the content 
of the proposition. Note that the shift is asymptotically negligible with respect 
to the typical order of magnitude of the fluctuations. The constants in the next 
proposition are not sharp enough to make the next proposition competitive with 
Proposition l4.4l Nevertheless it illustrates that Proposition l2.9l captures the correct 
order of growth for the right-tail of Gaussian maxima. 
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Proposition 4.5. For n such that the solution w„ of equation 16/x + log(l + 2/x- 
41og(4/a;)) = log(2n) is smaller than 1, for all < X < -h=, 



2(1 - ^,\) 



For all t > 0, 



P - EX(i) > y^{t + V2I) } < 
Proof. By Proposition [2]9l 



logEe^(^(i)-^^<i)) < [A (e^^ - l)] 

where A = - X^2) ^ ?7(2e^<2)+-^i ) - U{2e^'-^y), witli Ei is exponentially dis- 
tributed and independent of Y(2) which is distributed like the 2'^'* largest order 
statistics of an exponential sample. 
On the one hand, the conditional expectation 

E [({/(2e^^+^(^)) - {/(2e^<^))) (e^(^(2e"^+"<^0-C/(2e"<^' )) _ 1) jy^^^" 

is a non-increasing function of ^(2)- The maximum is achieved for Y(^2) — 0, and it 
is equal to : 

-a;(e^^ - l)da; < 2Ae- . 



V2E, 



2tt 

On the other hand, by Proposition 14.11 

J7(2e^i+^(^)) - t/(2e'^<^)) < 
Meanwhile, for < /i < 1/2, 



(2)J 



fix[e'^ — l)e ax = — < 



Hence 
AE 



(1_^)2 - i_2;, ■ 



< 



4A2 



log 2 + 17 



(2) 1 - 



^log 2 + ^(2) 



Letting T logn - log(l + 2A2 + 4 log(4/i;„)), 



logE 



,A(X(i)-EX(i)) 



< A^e^'/^pjy^^) < r} + 



4A2 



log 2 + r 1 _ 2V2A 



Vlog2+r 



By Lemma SH (i) < ^ . 

As A < 1/v^, log2 + T > 16/vn and (ii) < j^j^^ 



A) 



□ 



We may also use Theorem 12.91 to provide a Bernstein inequality for the median 
of absolute values of a Gaussian sample. We assume n/2 is an integer. 
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Proposition 4.6. Let Vn — 8/(nlog2). 
For alio < X< n/(27i^), 



2(1 - 2Xy%Jn) 
For all t > 0, 

P {^(«/2) - EX(„/2) > + 2v/^v7^i} < 

Proof. By Proposition l2.91 



logEe^(-^("/2)-^^("/2)) < ^AE [A„/2 (e-^"^"/^ - l)] 

where A„/2 = X(„/2) - ^(n/2+i) (2e^/(«/2)+^("/2+i)) _ U (e^^("/2+i)) where S 
is exponentially distributed and independent of y(„/2+i)- By Proposition 14. 1[ 

AA < < ^ A f^E 

- (n/2)^log2 + y(„/2+i) - (n/2)VBP V n 



Reasoning as in the proof of Proposition 14.5 



logEe^^-^f"/^)-"^^*"/^)) < 



2(1 - 2Xy/vn/n) 



□ 



As the hazard rate (t){x)/^{x) of the Gaussian distribution tends to as a; tends 
to —CXI, the preceding approach does not work when dealing with order statistics of 
Gaussian samples. Nevertheless, Proposition 14.21 paves the way to simple bounds 
on the variance of maxima of Gaussian samples. 

Proposition 4.7. Let Xi, . . . , Xn be n independent and identically distributed stan- 
dard Gaussian random variables, let ^(i) > • • . > ^{n) be the order statistics. 
For all n > 11, 

^ log(n/2) - logaT 41oglog(n/2)) + + ^^^^'t) + ^ ■ 

Proof of Proposition \4-T\ We may generate n independent standard Gaussian ran- 
dom variables in two steps: first generate n independent random signs (ei, . . . , e„: 
P{ei = 1} = 1 — P{ei = —1} = 1/2), then generate absolute values (Fi, . . . , 
the resulting sample (Xi, . . . , Xn) is obtained as Xi — eiVi. Let N be the number 
of positive random signs. 

Var(X(fe)) = E [Var | a{N))] + Var (E | <j{N)]) . 



Conditionally on = m, if fc < m, is distributed as the fc"^ order statistic of a 
sample of m independent absolute values of Gaussian random variables. If fc > m, 
X(^k) is negative, its conditional variance is equal to the variance of the statistics 
of order n — k + 1 in a sample of size n — m. Hence, letting y^™j denote the A:*^ 
order statistic of a sample of n independent absolute values of Gaussian random 
variables. 
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STEPHANE BOUCHERON AND MAUD THOMAS 



For k = l, (ii) < (i), as 

n 

(0 = E 



m— 1 

n/4 



2-"VarfV; 



(1)^ 



2-"Var(y(^) 



< 



< 



E(:)2-"+Var(y^^/^)+: 

m — 1 V / 



exp(-n/8) + Var {V^'/^^) + 2 



where the middle term can be upper bounded using Proposition 



Let Hn be the random harmonic number = X^i^^i 1/ 



(ii) 



TT, 



U\ exp 



< 



-E 



TV, TV' 



(e[X(i) |iV]-E[X(i) |7V']) 
(e [c/(exp(EL 

l/M^(exp(Et'if)))(^^-^A^') 

{Hn — ) 



TT 



-Var(i/Ar) 



Now, as N ^ Etii^ + e^)/2, letting Z ^ Hn and Z, = ^ ^ 



_ iV-(l+e.)/2 



by 



Efron-Stein inequahty, VarZ < E[0 A Finally, using Hoeffding inequality in 

a crude way leads to E[0 A 1/N] < exp(— n/8) + 4/n < 8/n. We may conclude by 
ii < (47r)/n. □ 

Remark 4.8. Trading simplicity for tightness, sharper bounds on ii could be derived 
and show that ii = 0(l/(nlogn)). 
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Appendix A. Proof of Proposition \4A\ 



P{Y(i) ~Hn> tL/(exp(i/„))} < exp 



Let Y(i) denote the maximum of a sample of absolute values of independent 
exponentially distributed random variables, so that Xi^i^ ^ [/(e^'^'). 

Thanks to the concavity oi U o exp, and the fact that ^{x)/(j){x) < 1/x for 
x>0, t/(exp(Y(i))) - (7(exp(iJ„)) < (Y^r) ~ i?„)/(c7(exp(i?„)). Now, Y(i) satisfies 
a Bernstein inequality with variance factor not larger than Var(Y(i)) < 7r^/6 < 2 
and scale factor not larger than 1, so 

2{2 + tU{cxp{Hn))/3)^ 

Agreeing on Sn = U{e^") — EC/(e^(i>), using the monotonicity of x'^/{2 + x/3) and 
logn < Hn, we obtain the first part of the proposition. 

The second-order extended regular variation condition satisfied the Gaussian dis- 
tribution (See [2] Exercise 2.9 p. 61) allows to assert U{n)^ (lJ{e^n) — U{n) — - 

2 

— ^ — X . This suggests that 

\ C/(exp(i7„))y 

or that the order of magnitude of C/(e-^") — EA(i') is 0{1/U{n)^) which is small 
with respect to l/U{n). 

Use Theorem B.3.10 from 2 , to ensure that for all S,e > 0, for n,x such that 

min(n, nexp(x)) > to(e, U{n)^ (u{e^n) — U{n) — + ^ + x < eexp((5|x|) 

The probability that Y(i) < Hn/2 is less than cxp(— iJ„/3). When for Y(i) — i7„ < 0, 
the supremum of 

is achieved when — = — it is less than 4(logn)^. The dominated 
convergence theorem allows to conclude. 
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